Characterizing immune variation and diagnostic indicators of preeclampsia by single-cell RNA sequencing and machine learning

Preeclampsia is a multifactorial and heterogeneous complication of pregnancy. Here, we utilize single-cell RNA sequencing to dissect the involvement of circulating immune cells in preeclampsia. Our findings reveal downregulation of immune response in lymphocyte subsets in preeclampsia, such as reduction in natural killer cells and cytotoxic genes expression, and expansion of regulatory T cells. But the activation of naïve T cell and monocyte subsets, as well as increased MHC-II-mediated pathway in antigen-presenting cells were still observed in preeclampsia. Notably, we identified key monocyte subsets in preeclampsia, with significantly increased expression of angiogenesis pathways and pro-inflammatory S100 family genes in VCAN+ monocytes and IFN+ non-classical monocytes. Furthermore, four cell-type-specific machine-learning models have been developed to identify potential diagnostic indicators of preeclampsia. Collectively, our study demonstrates transcriptomic alternations of circulating immune cells and identifies immune components that could be involved in pathophysiology of preeclampsia.


Statistics
For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section.

n/a Confirmed
The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly The statistical test(s) used AND whether they are one-or two-sided Only common tests should be described solely by name; describe more complex techniques in the Methods section.

A description of all covariates tested
A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons A full description of the statistical parameters including central tendency (e.g.means) or other basic estimates (e.g.regression coefficient) AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g.confidence intervals) For null hypothesis testing, the test statistic (e.g.F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted Give P values as exact values whenever suitable.
For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes Estimates of effect sizes (e.g.Cohen's d, Pearson's r), indicating how they were calculated Our web collection on statistics for biologists contains articles on many of the points above.

Software and code
Policy information about availability of computer code Data collection DNBelab C4 system (BGl, v1.0) was used to process scRNA-seq data and generate single-cell expression matrix.
For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors and reviewers.We strongly encourage code deposition in a community repository (e.g.GitHub).See the Nature Portfolio guidelines for submitting code & software for further information.

April 2023
Reporting for specific materials, systems and methods We require information from authors about some types of materials, experimental systems and methods used in many studies.Here, indicate whether each material, system or method listed is relevant to your study.If you are not sure if a list item applies to your research, read the appropriate section before selecting a response.

Novel plant genotypes
Describe the methods by which all novel plant genotypes were produced.This includes those generated by transgenic approaches, gene editing, chemical/radiation-based mutagenesis and hybridization.For transgenic lines, describe the transformation method, the number of independent lines analyzed and the generation upon which experiments were performed.For gene-edited lines, describe the editor used, the endogenous sequence targeted for editing, the targeting guide RNA sequence (if applicable) and how the editor was applied.

Seed stocks
Report on the source of all seed stocks or other plant material used.If applicable, state the seed stock centre and catalogue number.If plant specimens were collected from the field, describe the collection location, date and sampling procedures.